Load datasets

here we load each dataset (wti,wtc,koi,koc) and do NOT integrate them

we are going to conduct a series of quality check steps in coming codes

Doublet score evaluation

Doublet algorithm parameter selection

We found the following pKs for doublet score evaluation:

Note that KOC sample have a pK peak at 0.12, but larger pK usually raise false positives, we chose a local maximum value at 0.02

doublet rate estimation

According to 10x Genomics' data, the doublet rate is round %1/1kCells. We here by estimate our doublet rates as follow:

Compute doublet scores

Display doublet scores

Load the integrated dataset and add the doublet information